Under consideration for publication in Theory and Practice of Logic Programming 



1 



Abductive Logic Programs with Penalization: 
Semantics, Complexity and Implementation 



SIMONA PERRI 

Department of Mathematics, University of Calabria 
87030 Rende (CS), Italy 
{e-mail: perri@mat . unical . it) 



FRANCESCO SCARCELLO 

DEIS, University of Calabria 
87030 Rende (CS), Italy 
{e-mail: scarcello@deis . unical . it) 



NICOLA LEONE 

Department of Mathematics, University of Calabria 
87030 Rende (CS), Italy 
{e-mail: leone@mat . unical . it) 



submitted 28 Januari 2003; revised 16 October 2003; accepted 21 October 2003 



Abstract 

Abduction, first proposed in tiie setting of classical logics, has been studied with growing interest in 
the logic programming area during the last years. 

In this paper we study abduction with penalization in the logic programming framework. This 
form of abductive reasoning, which has not been previously analyzed in logic programming, turns 
out to represent several relevant problems, including optimization problems, very naturally. We de- 
fine a formal model for abduction with penalization over logic programs, which extends the abduc- 
tive framework proposed by Kakas and Mancarella. We address knowledge representation issues, 
encoding a number of problems in our abductive framework. In particular, we consider some rel- 
evant problems, taken from different domains, ranging from optimization theory to diagnosis and 
planning; their encodings turn out to be simple and elegant in our formalism. We thoroughly analyze 
the computational complexity of the main problems arising in the context of abduction with penal- 
ization from logic programs. Finally, we implement a system supporting the proposed abductive 
framework on top of the DLV engine. To this end, we design a translation from abduction problems 
with penalties into logic programs with weak constraints. We prove that this approach is sound and 
complete. 

KEYWORDS: Knowledge Representation, Nonmonotonic Reasoning, Abduction, Logic Programs, 
Computational Complexity, Stable Models, Optimization Problems, Penalization 
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1 Introduction 

Abduction is an important form of reasoning, first studied in depth by Peirce flQSSV Given 
the observation of some facts, abduction aims at concluding the presence of other facts, 
from which, together with an underlying theory, the observed facts can be explained, i.e., 
deductively derived. Thus, roughly speaking, abduction amounts to an inverse of modus 
ponens. 

For example, medical diagnosis is a typical abductive reasoning process: from the symp- 
toms and the medical knowledge, a diagnosis about a possible disease is abduced. Notice 
that this form of reasoning is not sound (a diagnosis may turn out to be wrong), and that 
in general several abductive explanations (i.e., diagnoses) for the observed symptoms may 
be possible. 

It has been recognized that abduction is an important principle of common-sense reason- 
ing, and that abduction has fruitful applications in a number of areas such diverse as model- 
based diagnosis (Poole 1989 ) . speech recognition ( Hobbs and M. E. Stickel 1993 1, model 
checking JBuccafurri et al. 1999> . maintenance of database views dKakas and Mancarella 1990al . 
and vision dCharniak and McDermott 1985> . 

Most research on abduction concerned abduction from classical logic theories. How- 
ever, there are several application domains where the use of logic programming to perform 
abductive reasoning seems more appropriate and natural dEiter et al. 1997> . 

For instance, consider the following scenario. Assume that it is Sunday and is known 
that Fabrizio plays soccer on Sundays if it's not raining. This may be represented by the 
following theory T: 

play .soccer <— is^sunday A not rains issunday ^ 

Now you observe that Fabrizio is not out playing soccer (rather, he is writing a paper). 
Intuitively, from this observation we conclude that it rains (i.e, we abduce rains), for 
otherwise Fabrizio would be out playing soccer. Nevertheless, under classical inference, the 
fact rains is not an explanation of not play _soccer , as T U {rains^ ^ not play^soccer 
(neither can one find any explanation). On the contrary, if we adopt the semantics of logic 
programming (interpreting not as the nonmonotonic negation operator), then, according 
with the intuition, we obtain that rains is an explanation of not play.soccer, as it is 
entailed by T U {rains}. 

In the context of logic programming, abduction has been first proposed by Kakas and 
Mancarella (I1990b> and, during the recent years, the interest in this subject has been grow- 
ing rapidly (Console et al. 199r KonoHge 1992 Kakas et al. 1992 Dung 1991 Denecker and De Sc hreye 1995) 
ISakama and Inoue 2000 .Brena 1998 Kakas et al. 2000. Denecker and Kakas 2002, Lin and You 20021 . 
This is also due to some advantages in deahng with incomplete information that this kind of 
reasoning has over deduction ( [Denecker and De Schreye 1995|IBaral and Gelfond 1994t . 

Unlike most of these previous works on abduction in the logic programming framework, 
in this paper we study abduction with penalization from logic programs. This form of 
abductive reasoning, well studied in the setting of classical logics JEiter and Gottlob 1995t . 
has not been previously analyzed in logic programming. 

Note that dealing with weights or penalties has been recognized as a very important 
feature of knowledge representation systems. In fact, even at the very recent Workshop on 



Abductive Logic Programs with Penalization 



3 



Nonmonotonic Reasoning, Answer Set Programming and Constraints (Dagstuhl, Germany, 
2002), many talks and system demonstrations pointed out that a lot of problems arising in 
real applications requires the ability to discriminate over different candidate solutions, by 
means of some suitable preference relationship. Note that this is not just an esthetic issue, 
for representing such problems in a more natural and declarative way. Rather, a proper 
use of preferences may have a dramatic impact even on the efficiency of solving these 
problems. 

In this paper, we define a formal model for abduction with penalization from logic pro- 
grams, which extends the abductive framework proposed by Kakas and Mancarella J1990bt . 
Roughly, a problem of abduction with penalization V consists of a logic program P, a set 
of hypotheses, a set of observations, and a function that assigns a penalty to each hypothe- 
sis. An admissible solution is a set of hypotheses such that all observations can be derived 
from P assuming that these hypotheses are true. Each solution is weighted by the sum of 
the penalties associated with its hypotheses. The optimal solutions are those with the min- 
imum weight, which are considered more likely to occur, and thus are preferred over other 
solutions with higher penalties. 

We face knowledge representation issues, by showing how abduction with penaliza- 
tion from logic programming can be used for encoding easily and in a natural way relevant 
problems belonging to different domains. In particular, we consider the classical Travelling 
Salesman Problem from optimization theory, (a new version of) the Strategic Companies 
Problem, the planning problem Blocks World, from artificial intelligence. It is worthwhile 
noting that these problems cannot be encoded at all in (function-free) normal logic pro- 
gramming, even under the powerful stable model semantics. 

We analyze the computational complexity of the main problems arising in this frame- 
work, namely, given a problem V of abduction with penalization over logic programs, 

• decide whether V is consistent, i.e., there exists a solution for V; 

• decide whether a given set of hypotheses is an admissible solution for V; 

• decide whether a given set of hypotheses is an optimal solution for V; 

• decide whether a given hypothesis h is relevant for V, i.e., h occurs in some optimal 
solution of V; 

• decide whether a given hypothesis h is necessary for V, i.e., h is contained in all 
optimal solutions of V; 

• compute an optimal solution of V. 

The table in Figure shows the complexity of all these problems, both in the general 
case and in the restricted setting where the use of unstratified negation is forbidden in 
the logic program of the abduction problem. Note that a complexity class C in any entry 
of this table means that the corresponding problem is C-complete, that is, we prove both 
membership and hardness of the problem for the complexity class C. 

An interesting result in this course is that "negation comes for free" in most cases. That 
is, the addition of negation does not cause any further increase to the complexity of the 
main abductive reasoning tasks (which remains the same as for not-free programs). Thus, 
the user can enjoy the knowledge representation power of nonmonotonic negation without 
paying additional costs in terms of computational overhead. More precisely, it turns out 
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that abduction with penalization over general logic programs has exactly the same com- 
plexity as abduction with penalization over definite Horn theories of classical logics in the 
three main computational abductive-reasoning tasks (deciding relevancy and necessity of 
an hypothesis, and computing an optimal solution). While unstratified negation brings a 
relevant complexity gap in deductive reasoning (from V to NP for brave reasoning), in 
this case, the use of negation does not lead to any increase in the complexity, as shown in 
Figure [J 
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Fig. 1. Overview of the Complexity Results 



We have implemented the proposed framework for abduction with penalization over 
logic programs as a front-end for the DLV system. Our implementation is based on an al- 
gorithm that translates an abduction problem with penalties into a logic program with weak 
constraints JBuccafurri et al. 2000> . which is then evaluated by DLV. We prove that our ap- 
proach is sound and complete. Our abductive system is available in the current release of 
the DLV system (www . dlvsystem . com), and can be freely retrieved for experiments. 
It is worthwhile noting that our rewriting approach can be adapted for other ASP systems 
with suitable constructs for dealing with weighted preferences. For instance, our algorithm 
can be modified easily in order to compute programs with weight literals to be evaluated 
by the Smodels system (Simons et al. 2002 1. 

In sum, the main contribution of the paper is the following. 

• We define a formal model of abduction with penalization over logic programs. 

• We carry out a thorough analysis of the complexity of the main computational prob- 
lems arising in the context of abduction with penahzation over logic programs. 

• We address knowledge representation issues, showing how some relevant problems 
can be encoded in our framework in a simple and fully declarative way. 

• We provide an implementation of the proposed abductive framework on top of the 
DLV system. 

Our work is evidently related to previous studies on semantic and knowledge represen- 
tation aspects of abduction over logic programs. In Section^ we discuss the relationships 
of this paper with such previous studies and with some further related issues. 

The rest of the paper is organized as follows. In Section |2l we recall the syntax of 
(function-free) logic programs and the stable model semantics. In Section |3l we define 
our model of abduction with penalization from logic programs, and in Section |3 we give 
some examples of applications of this form of abduction in different domains. In Section 
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|5] we analyze the computational complexity of the main problems arising in this frame- 
work. In Section|6j we describe our prototype that implements abduction with penalization 
from logic programs, and makes it available as a front end of the system DLV. Section0is 
devoted to related works. Finally, in Section|8] we draw our conclusions. 

2 Preliminaries on Logic Programming 

We next give the syntax of function-free logic programs, possibly containing nonmono- 
tonic negation (negation as failure) and constraints. Then, we recall the stable model se- 
mantics JGelfond and Lifschitz 19881 for such logic programs. 

2.1 Syntax 

A term is either a constant or a variable^. An atom has the form a(ti, t^), where a is 
a predicate of arity n and ti, tn are terms. A literal is either a positive literal a or a 
negative literal not a, where a is an atom. 
A rule r has the form 

a :— bi, . . . , bit, not bk+i, ■ • ■ ,iiot b^. k > 0, m > k 

where a, bi, . . . , bj, are atoms. 

Atom ais the /leoiiof r, while the conjunction bi, . . . ,bk,not b^+i, . . . ,not 'b^isXhtbody 
of r. We denote by H{r) the head atom a, and by B{r) the set {bi, . . . , b^, not b^+i, . . . , 
not b„} of the body literals. Moreover, B^{r) and B^{r) denote the set of positive and 
negative literals occurring in B{r), respectively. If B{r) — 0, i.e., m = 0, then r is a. fact. 

A strong constraint (integrity constraint) has the form Li, .... L^., where each Li, 
1 < i < m , is a literal; thus, a strong constraint is a rule with empty head. 

A (logic) program P is a finite set of rules and constraints. A negation-free program 
is called positive program. A positive program where no strong constraint occurs is a 
constraint-free program. 

A term, an atom, a literal, a rule or a program is ground if no variable appears in it. A 
ground program is also called a propositional program. 

2.2 Stable model semantics 

Let P be a program. The Herbrand Universe Up of Pis the set of all constants appearing in 
P. The Herbrand Base Bp of P is the set of all possible ground atoms constructible from 
the predicates appearing in the rules of P and the constants occurring in Up (clearly, both 
Up and Bp are finite). Given a rule r occurring in a program P, a ground instance of r is 
a rule obtained from r by replacing every variable X in r by (j{X), where cr is a mapping 
from the variables occurring in r to the constants in Up. We denote by ground{P) the 
(finite) set of all the ground instances of the rules occurring in P. An interpretation for P 
is a subset / of Bp (i.e., it is a set of ground atoms). A positive literal a (resp. a negative 
literal not a) is true with respect to an interpretation / if a G / (resp. a ^ /); otherwise it 

^ Note that function symbols are not considered in tliis paper. 
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is false. A ground rule r is satisfied (or true) w.r.t. / if its head is true w.rt. / or its body is 
false w.rt. /. 

A model for P is an interpretation M for P such that every rule r e ground{V) is true 
w.r.t. M. If P is a positive program and has some model, then P has a (unique) least model 
(i.e., a model included in every model), denoted by lm{P). 

Given a logic program P and an interpretation I, the Gelfond-Lifschitz transformation of 
P with respect to / is the logic program P^ consisting of all rules a bi, . . . , b^ such that 
(1) a :— bi, . . . , bit, not bk+i, ■ • • ,not bj, £ P and (2) bi ^ /, for all k < i < m. 
Notice that not does not occur in P^ , i.e., it is a positive program. 

An interpretation / is a stable model of P if it is the least model of its Gelfond-Lifschitz 
w.r.t. /, i.e., if / = lm{P^) dGelfond and Lifschitz 1988t . The collection of all stable 
models of P is denoted by SM{P) (i.e., SM{P) = {I\I = lm{P^)}). 

Example 2.1 

Consider the following (ground) program P: 

a :— not b . b :— not a . c :— a . c :— b . 

The stable models of P are Mi = {a, c} and M2 — {b, c}. Indeed, by definition of 
Gelfond-Lifschitz transformation, 

P^^i = { a :-, c ;-a, c :-b } and P*^^ = { b c :-a, c :-b } 

and it can be immediately recognized that lm{P^^^ ) = Mi and lm{P'^^^ ) = M2. 

We say that an atom p depends on an atom q if there is a rule r in P such that p ~ H{r) 
and either q e P+(r) or not q G B~{r). Let < denote the transitive closure of this 
dependency relationship. The program P is a recursive program if there are p, q e Bp 
such that p < q and q < p. We say that P is unstratified, or that unstratified negation 
occurs in P, if there is a rule r in P such that p = H{r), not q G B^{r), and q < P- ^ 
program where no unstratified negation occurs is called stratified. 

Observe that every stratified program P has at most one stable model. The existence 
of a stable model is guaranteed if no strong constraint occurs in the stratified program 
P. Moreover, every stratified program can be evaluated in polynomial time. In particular, 
deciding whether there is a stable model, computing such a model, or deciding whether 
some literal is entailed (either bravely or cautiously) by the program are all polynomial- 
time feasible tasks. 

For a set of atoms X, we denote by facts{X) the set of facts {p. \ p ^ X}. Clearly, for 
any program P and set of atoms S, all stable models of P U facts (S) include the atoms in 
S. 

3 A Model of Abduction with Penalization 

First, we give the formal definition of a problem of abduction from logic programs under 
the stable model semantics, and we provide an example on network diagnosis, that we use 
as a running example throughout the paper Then, we extend this framework by introducing 
the notion of penaHzation. 
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Fig. 2. Computer network M in Example |3.2l 
Definition 3.1 

(Abduction From Logic Programs) 

A problem of abduction from logic programs V is a. triple {H, P, O), where H is a finite 
set of ground atoms called hypotheses, P is a logic program whose rules do not contain 
any hypothesis in their heads, and O is a finite set of ground literals, called observations, 
or manifestations. 

A set of hypotheses S" C _ff is an admissible solution (or explanation) to V if there exists 
a stable model M of P U facts{S) such that, \/o <E 0,o is true w.rt. M. 

The set of all admissible solutions to V is denoted by Adm{V). | 

Example 3.2 

(Network Diagnosis) Suppose that we are working on machine a (and we therefore 
know that machine a is online) of the computer network A/^ in Figure^ but we observe ma- 
chine e is not reachable from a, even if we are aware that e is online. We would like to know 
which machines could be offline. This can be easily modelled in our abduction framework 
defining a problem of abduction Vi = {H, P, O), where the set of hypotheses is H = 
{offline(a), offline(b), of f line(c), of f line(d), offline(e), offline(f)}, the 
set of observations is O = {not of f liiie(a), not of f line(e), not reaclies(a, e)}, and 
the program P consists of the set of facts encoding the network, facts ({connected(X, Y) | 
{X, Y} is an edge of N}), and of the following rules: 

reaches(X,X) :— node(X),not offline(X). 

reaches(X,Z) :— reaches (X, Y), connected(Y, Z), not offline(Z). 

Note that the admissible solutions for Vi corresponds to the network configurations that 
may explain the observations in O. In this example, Adm{V) contains five solutions 



^1 

^3 
^5 



{offline(f), offline(b)}, 
{offline(f), offline(c), offline(d)}, 
{offline(f), offline(b), offline(c)}, 
{offline(f), offline(b), offline(d)}, 
{offline(f), offline(b), offline(c), offline(d)}. 
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Note that Definition 13 . 1 1 concerns only the logical properties of the hypotheses, and it 
does not take into account any kind of minimality criterion. We next define the problem 
of abduction with penalization, which allows us to make finer abductive reasonings, by 
expressing preferences on different sets of hypotheses, in order to single out the most 
plausible abductive explanations. 

Definition 3.3 

(Abduction With Penalization From Logic Programs) 

A problem of abduction with penalization (PAP) V isa tuple {H, P,0,j), where {H, P, O) 
is a problem of abduction, and 7 is a polynomial-time computable function from H to the 
set of non-negative reals (the penalty function). The set of admissible solutions for V is the 
same as the set of solutions of the embedded abduction problem (iJ, P, O), i.e., we define 
Adm(V) =Adm{{H,P,0)). 

For a set of atoms A, let sum^(yl) = TlheA 7(^)- Then, S is an (optimal) solution (or 
explanation) for Vif (i) S ^ Adm{V) wd(\\) sum^(S) < SMm^(S"), forall S" ^Adm{V). 

The set of all (optimal) solutions for V is denoted by Opt{V). | 

Example 3.4 

(Minimum-cardinality criterion) Consider again the network TV and the problem 
of abduction Vi = {H, P, O) in Example 13. 21 Again, we want to explain why the online 
machine e is not reachable from a. However, we do not consider any more plausible all the 
explanations provided by Vi. Rather, our domain knowledge suggests that it is unlikely 
that many machines are offline at the same time, and thus we are interested in explanations 
with the minimum number of offline machines. This problem is easily represented by the 
problem of abduction with penalization V2 = {H, P, 0, 7), where H, P and O are the 
same as in Vi, and, for each h E H, 7(^1) = 1. 

Indeed, consider the admissible solutions of V2 and observe that 

sum^(Si) — 2, sumj{S2) = sumj(S3) = sum^(S4) = 3, sumj{S<i) — 4 

It follows that 5*1 is the unique optimal explanation for P2, and in fact corresponds to the 
unique solution of our diagnosis problem with a minimum number of offline machines. 

The following properties of a hypothesis in a PAP V are of natural interest with respect 
to computing abductive solutions. 

Definition 3.5 

Let V = {H, P, 0, 7) be a PAP and h e H. Then, h is relevant for Pit h e S for some 
S G Opt{V), and h is necessary for V if h G S for every S G Opt(V). 

Example 3.6 

In example [3. 41 of f line(b), and of f line(f ) are the relevant hypotheses; they are also 
necessary since Si is the only optimal solution. 

4 Knowledge Representation 

In this section, we show how abduction with penalization from logic programming can be 
used for encoding easily and in a natural way relevant problems from different domains. 
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A nice discussion of how abduction can be used for representing knowledge declara- 
tively can be found in ( Denecker and Kakas 2002 1, where this setting is also related to 
other nonmonotonic reasoning paradigms. It also recalled that abduction has been defined 
broadly as any form of "inference to the best explanation" ( |Josephson and S.G. Josephson 1994) , 
where best refers to the fact that usually hypotheses can be compared according to some 
criterion. 

In our framework, this optimality criterion is the sum of the penalties associated to the 
hypotheses, which has to be minimized. 

In particular, in order to represent a problem, we have to identify: 

• the hypotheses, that represent all the possible entities that are candidates for belong- 
ing to solutions; 

• for each hypothesis h, the penalty associated to h, that represents the cost of includ- 
ing /i in a solution; 

• the logic program P, that encodes a representation of the reality of interest and, in 
particular, of the way any given set of hypotheses changes this reality and leads to 
some consequences; 

• the observations, or manifestations, that are distinguished logical consequences, of- 
ten encoding some desiderata. For any given set of hypotheses H, the fact that these 
observations are consequences of the logic program (plus H) witnesses that H is a 
"good" set of hypotheses, i.e., it encodes a feasible solution for the problem at hand. 

For instance, in the network diagnosis problem described in Example l3.4l the hypotheses 
are the possible offline machines and the logic program P is able to determine, for any 
given set of offline machines encoding a network status, which machines are unreachable. 
In this case, and usually in diagnosis problems, these observations are in fact pictures of 
the reality of interest: we see that some machines are not reachable in the network and 
that some machines are not offline, and we would like to infer, via abductive reasoning, 
what are the explanations for such a situation. Moreover, in this example, we are interested 
only in solutions that consist of the minimum number of offline machines, leading to the 
observed network status, because they are believed more likely to occur. This is obtained 
easily, by assigning a unitary penalty to each hypothesis. 

We next show the encodings of other kind of problems that can be represented in a 
natural way trough abduction with penalization from logic programs, even though they are 
quite different from the above simple cause-effect scheme. 

For the sake of presentation, we assume in this section that logic programs are equipped 
with the built-in predicates j^, <, >, and +, with the usual meaning. Clearly, for any 
given program P, these predicates may be encoded by suitable finite sets of facts, be- 
cause we have to deal only with the (finite) set of constants actually occurring in P. 
Moreover, observe that most available systems for evaluating logic programs - e.g., DLV 
JEiter et al. 1998HLeone et al. 2002> and smodels ( iNiemela and Simons 1997ilSrmons et al. 2002b 
- provide in fact such operators. 

4.1 The Travelling Salesman Problem 

An instance / of the Travelling Salesman Problem (TSP) consists of a number of cities 
ci, . . . , c„, and a function w that assigns to any pair of cities a, cj a positive integer value. 
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which represents the cost of travelling from Ci to Cj . A solution to / is a round trip that visits 
all cities in sequence and has minimal travelling cost, i.e., a permutation r of 1, . . . , n such 
that the overall cost 

n-l 

w{t) — w{T{i), T{i + 1)) + w{T{n), t(1)) 

4=1 

is minimum. 

Let us see how we can represent this problem in our framework. Intuitively, any solution 
consists of pair of cities encoding a tour of the salesman, while the observations must 
witness that this tour is correct, i.e., that all cities are visited exactly once. Thus, we have a 
hypothesis for each pair of cities Ci, Cj, because any such a pair is candidate for belonging 
to the trip of the salesman. The penalty associated to each hypothesis is clearly the cost 
of travelling from c,; to cj, because we want to obtain the minimum-cost tour. Moreover, 
for any given trip encoded by a set of hypotheses, the logic program determines the cities 
reached by the salesman, and also whether the salesman has travelled in a correct way. The 
observations are possible consequences of the program, which encode that all cities are 
visited and no visiting rule has been violated. 

Formally, we represent the TSP instance / as a PAP V ~ {H,P,0,j) defined as 
follows. The set of hypotheses is iJ = {c(i,j) | 1 < i,j < n}, where c(i,j) en- 
codes the fact that the salesman visits city j immediately after city i. The penalty function 
7(c(i,j)) = w(i,j) encodes the cost of travelling from i to j. The cities are encoded 
through a set of atoms {city(i) | 1 < i < n}. The program P contains the following 
rules: 



(1) city(i). 


for each i 


(2) visited(l) 


:— visited(j), c(J, I). 


(3) visited(l) 


:- c(J,l). 


(4) missedCity 


:— city(l), not visited(l). 


(5) badTour 


:- c(l, J), c(l,K), J ^ K. 


(6) badTour 


;- c(J, I), c(K, I), J 7^ K. 


The observations are O = 


{not missedCity, not badTour}. 



It is easy to see that every optimal solution S G OptiV) corresponds to an optimal tour 
and viceversa. The facts (1) of P encode the cities to be visited. Rule (2) states that a city 
i has been visited if the salesman goes to city i after an already visited city j. Rule (3) 
concerns the first city that, w.l.o.g., is the first and the last city of the tour. In particular, 
it is considered visited, if it is reached by some other city j, which is turn forced to be 
visited, by the other rule of P. Rule (4) says that there is a missed city if at least one of 
the cities has not been visited. Atom badTour, defined by rules (4) and (5), is true if some 
city is in two or more connection endpoints or connection startpoints. The observations 
not missedCity, not badTour enforce that admissible solutions correspond to salesman 
tours that are complete (no city is missed) and legal (no city is visited twice). 

Moreover, since optimal solutions minimize the sum of the connection costs, abductive 
solutions in Opt{V ) correspond one-to-one to the optimal tours. 

In liEiter et al. 1997t . Eiter, Gottlob, and Mannila show that Disjunctive Logic Program- 
ming (function-free logic programming with disjunction in the heads and negation in the 
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bodies of the rules) is highly expressive. Moreover, the authors strength the theoretical 
analysis of the expressiveness by proving that problems relevant in practice like, e.g., the 
Travelling Salesman Problem and Eigenvector, can be programmed in DLP, while they 
cannot be expressed by disjunction-free programs. Indeed, recall that computing an opti- 
mal tour is both NP-hard and co-NP-hard. Moreover, in p^apadimitriou 1984 ) it is shown 
that deciding whether the cost of an optimal tour is even, as well as deciding whether there 
exists a unique optimal tour, are Af -complete problems. Hence, it is not possible to ex- 
press this problem in disjunction-free logic programming, even if unstratified negation is 
allowed (unless the polynomial hierarchy collapses). 

Nevertheless, the logic programs implementing these problems in DLP highlight, in 
our opinion, a weakness of the language for the representation of optimization problems. 
The programs are very complex and tricky, the language does not provide a clean and 
declarative way to implement these problems.^ For a comparison, we report in Appendix 
[Appendix B| the encoding of this problem in (plain) DLP, as described in dEiter et al. 1997> . 
Evidently, abduction with penalization provides a simpler, more compact, and more ele- 
gant encoding of TSP. Moreover, note that, using this form of abduction, even normal 
(disjunction-free) programs are sufficient for encoding such optimization problems. 

4.2 Strategic Companies 

We present a new version of the strategic companies problem (ICadoli et al. 1997V A man- 
ager of a holding identifies a set of crucial goods, and she wants these goods to be produced 
by the companies controlled by her holding. In order to meet this goal, she can decide to 
buy some companies, that is to buy enough shares to get the full control of these compa- 
nies. Note that, in this scenario, each company may own some quantity of shares of another 
company. Thus, any company may be controlled either directly, if it is bought by the hold- 
ing, or indirectly, through the control over companies that own more than 50% of its shares. 
Of course, it is prescribed to minimize the quantity of money spent for achieving the goal, 
i.e., for buying new companies. 

For the sake of simplicity, we will assume that, if a company X can be controlled indi- 
rectly, than there are either one or two companies that together own more than 50% of the 
shares of X. Thus, controlling these companies is sufficient to take the control over X. 

We next describe a problem of abduction from logic programs with penalization V 
whose optimal solutions correspond to the optimal choices for the manager In this case, 
the observations are the crucial goods that we want to produce, while the hypotheses are 
the acquisitions of the holding and their associated penalties are the costs of making these 
financial operations. The logic program determines, for any given set of acquisitions, all 
the companies controlled by the holding and all the goods produced by these companies. 

Companies configurations are encoded by the set of atoms Market defined as follows: 
if a company y owns n% of the shares of a company x then share(x,y,n) belongs to 



^ We refer to standard Disjunctive Logic Programming h ere. As shown in IBuccafurri et al. 2000L the addition 
of weak constraints, implemented in the DLV system lEiter et al. 20001 . is another way to enhance DLP to 
naturally express optimization problems. 
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Market, and if a company x produces a good a then producedBy (a, x) belongs to Market. 
No more atoms belong to this set. 

Then, let V = {H,P,0^^), where the set of hypotheses H = {bought (xi), ... , 
bought (xn)} encodes the companies that can be bought, and the set of observations O = 
{produced(yi), . . . , produced(yn)} encodes the set of goods to be produced. Moreover, 
for each atom bought (x) G H, 7(bought(x)) is the cost of buying the company x. The 
program P consists of the facts encoding the state of the market facts{Market) and of the 
following rules: 



(1) produced(X) 

(2) controlled(X) 

(3) controlled(X) 

(4) controlled(X) 



producedBy (X, Y), controlled(Y). 
bought (X). 

share(X, Y, N), controlled(Y), N > 50. 
share(X, Y, N), share(X, Z, M), 

coiitrolled(Y), coiitrolled(Z), M + N > 50, Y 7^ Z. 



budweiser 









heineken 



candia 



Fig. 3. Strategic Companies 



Example 4.1 

Consider the following sets of companies and goods: 

Companies={ barilla, saiwa, frutto, panino, budweiser, heineken, parmalat, candia } 
Goods= { wine, pasta, beer, tomatoes, bread, milk}. 

Figure Is] depicts the relationships share and producedBy among companies, and among 
products and companies, respectively. A solid arrow from a company C to a good G rep- 
resents that G is produced by C. A dashed arrow from a company Ci to a company C2 
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labelled by n means that Ci owns n% of the shares of C2. The cost (in millions of dollars) 
for buying directly a company is shown below: 

barilla 500 saiwa 400 frutto 350 panino 150 
budweiser 300 heineken 300 parmalat 300 candia 150 

Accordingly, the hypotheses and their respective penalties are 

bought (barilla) 7(bought (barilla)) = 500 
bought (saiwa) 7(bought (saiwa)) = 400 

bought (candia) 7(bought (candia)) — 150. 
The set of observations is 

O ={produced(pasta), produced(wine), produced(tomatoes), 
produced(bread), produced(beer), produced(milk)} 

This problem has the only optimal solution Si={barilla, frutto, heineken^, whose cost 
is 1 150 millions of dollars. Note that all goods in G can be produced also by buying the set 
of companies 82= {barilla, frutto, saiwa}. However, since saiwa is more expensive than 
heineken. Si is preferred to S2. 

4.3 Blocks world with penalization 

Planning is another scenario where abduction proves to be useful in encoding hard prob- 
lems in an easy way. 

The topic of logic -based languages for planning has recently received a renewed great 
deal of interest, and many approaches based on answer set semantics, situation calculus, 
event calculus, and causal knowledge have been proposed — see, e.g., JGelfond and Lifschitz 19981 
Eiter et al . 2003 Shanahan 2000 Turner 1999 1. 

We consider here the Blocks World Problem dLifschitz 1999> : given a set of blocks B 
in some initial configuration Start, a desired final configuration Goal, and a maximum 
amount of time lastTime, find a sequence of moves leading the blocks from state Start, 
to state Goal within the prescribed time bound. Legal moves and configurations obey the 
following rules: A block can be either on the top of another block, or on the table. A block 
has at most one block over it, and is said to be clear if there is no block over it. At each 
step (time unit), one or more clear blocks can be moved either on the table, or on the top of 
other blocks. Note that in this version of the Blocks World Problem more than one move 
can be performed in parallel, at each step. Thus, we additionally require that a block Bi 
cannot be moved on a block B2 at time T if also B2 is moved at time T. 

Assume that we want to compute legal sequences of moves - also called plans - that 
leads to the desired final state within the lastTime bound and that consists of the minimum 
possible number of moves. We next describe a problem of abduction from logic programs 
with penalization V whose optimal solutions correspond to such good plans. In this case, 
the observations encode the desired final configuration, while the hypotheses correspond to 
all possible moves. Since we are interested in minimum-length plans, we assign a unitary 
penalty to each move (hypothesis). Finally, the logic program has to determine the state of 
the system after each move and detect possible illegal moves. 
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Consider an instance BWP of the Blocks World Problem. Let i? = {&i, • • • , 6„} be the 
set of blocks and L — B \J {table} the set of possible locations. 

The blocks of BWP are encoded by the set of atoms SZocfcs — {block(bi), . . . ,block(bn)}, 
and the initial configuration is encoded by a set Start containing atoms of the form oii(b, £, 0), 
meaning that, at time 0, the block b is on the location i. 

Then, let P = {H,P,0,j). The set of hypotheses H = {move(b, ^, t) \ b e B,£ e 
L,0 < t < lastTime} encodes all the possible moves, where an atom move(b, £, t) means 
that, at time i, the block b is moved to the location £. The set of observations O contains 
atoms of the form on(b, £, lastTime), encoding the final desired state Goal. The penalty 
function 7 assigns 1 to each atom move(b,^, t) e H. Moreover, P = facts{Blocks) U 
facts{Start) U R, where R is the following set of rules: 



Note that the strong constraints in R discards models encoding invalid states and illegal 
moves. For instance, ConstraintOsays that it is forbidden to move a block B, if B is not 
clear 

Rule \l\ says that moving a block _B on a location L at time T causes B to be on L at 
time T + 1. Rule|2]represents the inertia of blocks, as it asserts that all blocks that are not 
moved at some time T remain in the same position at time T + 1. 

It is worthwhile noting that expressing such inertia rules is an important issue in knowl- 
edge representation, and clearly shows the advantage of using logic programming, when 
nonmotononic negation is needed. 

For instance, observe that Rule|2lis very natural and intuitive, thanks to the use of nega- 
tion in literal not moved(B, T). However, it is not clear how to express this simple rule - 
and inertia rules in general - by using classical theories.^ 

Example 4.2 

Consider a Blocks World instance where the initial configuration and the final desired state 
are shown in figure|3 and the maximum number of allowed steps is 6. Therefore, the set 
of observations of our abduction problem is {on(a, table, 6), on(b, a, 6), on(c, b, 6), 
on(d, c, 6), on(e, d, 6), on(f , e, 6)}. The set of hypotheses contains all the possible 
moves, that is 

H = {move(a, table, 0), move(a, table, 1), • • • , move(f , d, 6), move(f , e, 6)} 

^ In fact, there are some solutio ns to this prob lem for in teresting sp ecial cas es, suc h as settings where all actions 
on all fluents can be specified IReiter 19911 . Also, in IMcCain a nd Turner 1997V it is defined a nonmonotonic 
formalism based on causal laws that is powerful enough to represent inertia rules (unlike previous approaches 
based on inference rules only). A comprehensive discussion of the frame problem can be found in the book 
iShanahan 1997j . 



on(B,L,Tl) 
oii(B,L,Tl) 
moved(B, T) 



move(B,L,T),Tl = T + 1. 

oii(B, L, T), Tl = T + 1, not moved(B, T). 

move(B, _, T). 

on(B, L, T), on(B, LI, T), L 7^ LI. 
on(Bl,B,T),on(B2,B,T),B2 ^ Bl,block(B) 
on(B,B,T). 

move(B, Bl, T), move(Bl, L, T). 
move(B, L, T), on(Bl, B, T), B 7^ Bl. 



(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 
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Fig. 4. Blocks World 



Each move has cost 1. 

In this case, the minimum number of moves needed for reaching the final configu- 
ration is six. An optimal solution is {move(a, table, 0), move(b, a, 1), move(c, b, 2), 
inove(d, c, 3),inove(e, d, 4), move(f, e, 5)}. Note that the plan 

{move(a, table, O),move(c, table, O),move(b, a, 1), 
move(c, b, 2),move(d, c, 3),move(e, d, 4),move(f, e, 5)} 

though legal, is discarded by the minimality criterion, because it consists of seven moves. 

Finally, observe that the proposed framework of abduction from logic programs with 
penalties allows us to represent easily different plan-optimization strategies. For instance, 
assume that each block has a weight, and we want to minimize the total effort made for 
reaching the goal. Then, it is sufficient to modify the penalty function in the PAP V above 
as follows: for each hypothesis move(b, ^. t), let 7(move(b, ^. t)) — w, where w is the 
weight of the block b. 



5 Computational Complexity 

In this section, we study the computational complexity of the main problems arising in the 
framework of abduction with penalization from logic programs, both in the general case 
and when some syntactical restrictions are placed on logic programs. 



5.7 Preliminaries on Complexity Theory 

For NP-completeness and complexity theory, the reader is referred to ( jPapadimitriou 1994) . 
The classes I]^,n^ and of the Polynomial Hierarchy (PH) (cf. jStockmeyer 1987) ) 
are defined as follows: 

= = = P and for all fc > 1, 

Af = P^^-i , = NP^^-i , co-Ef . 

In particular, NP = Sf , co-NP Hf , and = P^^. Here P'^ and NP'^ denote the 
classes of problems that are solvable in polynomial time on a deterministic (resp. nonde- 
terministic) Turing machine with an oracle for any problem tt in the class C. The oracle 
replies to a query in unit time, and thus, roughly speaking, models a call to a subroutine 
for TT that is evaluated in unit time. The class contains all problems that consist of the 
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conjunction of two (independent) problems from and 11^, respectively. In particular, 
D2 is the class of problems that are the conjunction of an NP and a co-NP problem. 
Notice that for all fc > 1, 

C C Af+, C C PSPACE, 

where each inclusion is widely conjectured to be strict. 

We are also interested in the complexity of computing solutions, and thus in classes of 
functions. In particular, we consider the class FP^^, which is the class of functions corre- 
sponding to P'^^ (A^), and characterizing the complexity of many relevant optimization 
problems, such as the TSP problem ( |Papadinutriou 1984|papadimitriou 1994^ . Formally, 
this is the class of all functions that can be computed by a polynomial-time deterministic 
Turing transducer with an oracle in NP. Note that the only difference with the correspond- 
ing class of decision problems is that deterministic Turing transducers are equipped with 
an output tape, for writing the result of the computation. 



5.2 Complexity Results 

Throughout this section, we consider problems V = {H, P, 0, 7) such that P is a ground 
program, unless stated otherwise. 

Let $ — {Ci, . . . ,Cn} be a CNF propositional formula over variables Xi, . . . , Xr, 
denoted by var{^). With each Xi G var{^), 1 < i < ?% we associate two atoms 
Xi,Xi (denoted by lowercase characters), and an auxiliary atom assigned^, representing 
the propositional variable Xi, its negation not Xi, and the fact that some truth value has 
been assigned to it, respectively. Moreover, with each clause C : V ■ • ■ V in 'f, we 
associate a rule r{C) : contr ■.—negate{£i), . . . , negate{£m), where negate{£) = x, if 
£ = X, and negate{£) — x,if £ = not X. 

Define -P($) as the constraint-free positive program containing the following rules: 

r(C,). 1 < i < n 

inconsistent :—Xj,Xj. 1 < i < 

assigned j '-—Xj. 1 < i < 

assigned j '-—Xj. 1 < i < 
allAssigned :~ assigned i, . . . , assigned^. 

Let R be any set of rules whose heads are from [Si^i{xi,Xi}. Note that, for any stable 
model M of -P($) U R, allAssigned e M and inconsistent ^ M hold if and only if, 
for each X e var{^), exactly one atom from {x, x} belongs to M. That is, M encodes a 
truth-value assignment for $. Moreover, contr ^ M only if such a truth-value assignment 
satisfies all clauses of the formula $. In this case, we say that $ is satisfied by AI. 

On the other hand, given any truth-value assignment T : var(^) {true, false}, we 
denote by at{T) the set of atoms {x \ X & var{<^) andT(X) = true} U | X e 
var{<^) and T(X) = false}. It can be verified easily that, if T satisfies $, then P($) U 
facts {at (T)) has a unique stable model that contains allAssigned and contains neither 
contr nor inconsistent. 

The first problem we analyze is the consistency problem. That is the problem of deciding 
whether a PAP has some solution. 
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Theorem 5. 1 

Deciding whether a PAP V = {H, P,0,^) is consistent is NP-complete. Hardness holds 
even if P is a constraint-free positive program. 

Proof 

(Membership). We guess a set of hypotheses S C H and a set of ground atoms M, and 
then check that (i) M is a stable model of P U facts {S), and (ii) O is true w.r.t. M. Both 
these tasks are clearly feasible in polynomial time, and thus the problem is in NP. 

(Hardness). We reduce SAT to the consistency problem. Let $ be a CNF formula and 
P($) its corresponding logic program, as described above. Consider the PAP problem 
(if, P($), 0, 7), where ff = {x,x \ X G var{^)}, O = {not contr, not inconsistent, 
allAssigned}, and 7 is the constant function 0. 

Let S be an admissible solution for P, that is, there is a stable model M for P{^) 
such that allAssigned belongs to M, and neither contr nor inconsistent belongs to M. As 
observed above, this entails that $ is satisfied by the truth-assignment corresponding to 
M, and in fact encoded by the set of hypotheses S. Moreover, if $ is satisfiable, there is 
a truth-assignment T that satisfies it. Then, it is easy to check that at{T) is an admissible 
solution for P, since the unique stable model of P($) U facts {at (T)) contains allAssigned 
and no atom in {contr, inconsistent}. Thus, $ is satisfiable if and only if P is consistent. 
Finally, note that P can be computed in polynomial time from and that P does not 
contain negation or strong constraints. □ 

We next focus on the problem of checking whether a given set of atoms S is an ad- 
missible solution for a PAP V = {H, P, 0, 7). Observe that this task is clearly feasible 
in polynomial time if P is stratified, because in this case the (unique) stable model of 
P U facts{S) (if any, remember that strong constraints may occur in P) can be computed 
in polynomial time. It follows that this problem is easier than the consistency problem in 
this restricted setting. However, we next show that it remains NP-complete, in the general 
case. 

Theorem 5.2 

Deciding whether a set of atoms is an admissible solution for a PAP is NP-complete. 
Proof 

(Membership). Let V = {H, P, 0, 7) be a PAP and S a set of atoms. We guess a set of 
ground atoms M, and then check that (i) M is a stable model of P U facts (S), and (ii) 
O is true w.r.t. M. Both these tasks are clearly feasible in polynomial time, and thus the 
problem is in NP. 

(Hardness). We reduce SAT to the admissible solution problem. Let <1> be a CNF formula 
over variables {Xi ,Xr}, and P($) its corresponding logic program. Consider the PAP 
problem P = (0,P($) U G($),0,7), where O = {not contr, not inconsistent}, 7 is 
the constant function 0, and contains two rules x -.—not x and x -.—not x, for each 
X e war($). 

Let M be a stable model of P($) U Because of the rules in for each 
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pair of atoms x, x occurring in it, either a; or a; belongs M, and hence allAssigned, too. 
Thus, these atoms encode a truth-assignment T for $. Moreover, it is easy to check that 
contr, inconsistent ^ M only if this assignment T satisfies On the other hand, let T' 
be a satisfying truth-assignment for $, and let M' = at{T') U allAssigned U {assigned j \ 
1 < j '"}• Then, M' is a stable model of P($), and contr ^ inconsistent ^ M' , that is, 
all observations are true w.r.t. M' . 

Therefore, is an admissible solution for P if and only if $ is satisfiable. Note that 
unstratified negation occurs in G($). □ 

It turns out that deciding whether a solution is optimal is both NP-hard and co-NP- 
hard. However, this problem is not much more difficult than problems in these classes, as 
we need to solve just an NP and a co-NP -problem, independent of each other. 

Theorem 5.3 

Deciding whether a set of atoms is an optimal solution for a PAP is Df -complete. 
Proof 

(Membership). Let V = {H, P, 0, 7) be a PAP and let 5 be a set of atoms. To prove 
that S is an optimal solution for V first check that S is an admissible solution, and then 
check there is no better admissible solution. The former task is feasible in NP, by Theorem 
15.21 The latter is feasible in co-NP. Indeed, to prove that there is an admissible solution 
better than S, we guess a set of atoms S' C H and a model M for P, and then check in 
polynomial time that sum^{S') < sum^{S), M is a stable model of P U facts{S'), and 
O is true w.r.t. M. 

(Hardness). Let $1 and $2 be two CNF formulas, over disjoint sets of variables {Xi , . . . , Xr} 
and {X[, . . . , X!^}. Deciding whether $1 is satisfiable and $2 is not satisfiable is a Un- 
complete problem ( Papadimitriou and Yannakakis 1984} . Let P{^i) be the logic program 
associated with $1, and Gs{^i) a set of rules that contains, for each x G t;ar($i), two 
rules X : —not x, s and x : —not x, s. Let i-"($2) be the logic program associated with 
$2, but for the atoms contr, inconsistent, and allAssigned, which are uniformly replaced 
in this program by contr', inconsistent' , and allAssigned' , respectively. Moreover, let 
R be the set containing two rules ok : —not contr, not inconsistent, allAssigned and 
ok : —not contr', not inconsistent' , allAssigned' . Then, define P($i,$2) as the PAP 
problem {H,P,0,j), where P ^ P($i) U G^($i) U ^'($2) U R, H ^ {s} U {x',x' \ 
X' G var{^2)}, O — {ok}, and the penalty function 7 is defined as follows: 7(5) — 1 
and 7(/i) = 0, for any other hypothesis h e H — {s}. 

We claim that $1 is satisfiable and $2 is not satisfiable if and only if {s} is an optimal 
solution for P($i, $2)- 

(Only if). Assume that $1 is satisfiable and $2 is not satisfiable, and let Ti be a satisfying 
truth-value assignment for $1. Moreover, let M = {at{Ti) U {assigned j | 1 < j < 
r} U {s, allAssigned, ok}. Then, M is a stable model of PlJfacts{{s}) and thus {s} is an 
admissible solution for P($i, $2), and its cost is 1, as 7(5) = 1, by definition. Note that 
the only way to reduce the cost to is by finding a set of hypotheses that do not contain s, 
and is able to derive the observation ok. From the rules in R, this means that we have to 
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find a subset of {x', x' \ X' ^ var{^2)}, which encodes a satisfying truth assignment for 
4>2. However, this is impossible, because $2 is not satisfiable, and thus {s} is optimal. 

(If). Assume that {s} is an optimal solution for P(<i>i, ^2)- Its cost is 1, because 7(3) — 
1. Note that any set of hypotheses S' that encodes a satisfying truth-value assignment 
for <i>2 and does not contain s is an admissible solution for $2), and has cost 0. 

It follows that $2 is not satisfiable, as we assumed {s} is an optimal solution. There- 
fore, by definition of R, the only way to derive the atom ok is through the rule ok : 
—not contr, not inconsistent, allAssigned. Since {s} is also an admissible solution, we 
conclude that there is a stable model M that contains allAssigned, and no atom from 
{inconsistent, contr}. That is, M encodes a satisfying truth assignment for $1. □ 

If unstratified negation does not occur in logic programs, we lose a source of complexity, 
as checking whether a solution is admissible is easy. In fact, we show below that, in this 
case, the optimality problem becomes co-NP-complete. 

Theorem 5.4 

Let V — {H, P, 0, 7) be a PAP, where P is a stratified program. Deciding whether a set 
of atoms S is an optimal solution for V is co-NP-complete. Hardness holds even if P is a 
constraint-free positive program. 

Proof 

{Membership). Recall that checking whether a solution S is admissible is feasible in poly- 
nomial time if P is stratified. Thus, we have to check only that there is no admissible 
solution better than S, and this task is in co-NP, as shown in the proof of Theorem l5.3l 

{Hardness). Let <!> be a CNF formula, P(<i>) its corresponding logic program, and R be 
the set containing two rules ok : —s and ok : — allAssigned. Then, define P{^) as the 
/MP problem {H,P,0,j), where P = P($) U R, H ^ {s} U {x,x \ X e mr($)}, 
O — {ok, not contr, not inconsistent}, and the penalty function 7 is defined as follows: 
7(5) = 1 and 7(/i) = 0, for any other hypothesis h £ H — {s}. 

We claim that $ is not satisfiable if and only if {s} is an optimal solution for P{^). 

( Only if). Assume is not satisfiable. Then, there is no way of choosing a set of hypothe- 
ses that contains neither contr nor inconsistent and, furthermore, contains allAssigned and 
hence ok, but not s. It follows that the minimum cost for admissible solutions is 1. More- 
over, note that {s} is an admissible solution for P($), its cost is 1, and thus it is also 
optimal. 

(If). Let {s} be an optimal solution for P($) and assume, by contradiction, that $ is 
satisfiable. Then there is a set of hypotheses S C H — {s} that encodes a satisfying truth- 
value assignment for $ and has cost 0. However, this contradicts the fact that the solution 
{s}, which has cost 1, is optimal. □ 

We next determine the complexity of deciding the relevance of an hypothesis. 

Theorem 5.5 

Deciding whether an hypothesis is relevant for a PAP V = {H, P, 0, 7) is Al'-complete. 
Hardness holds even if P is a constraint-free positive program. 
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Proof 

(Membership). Let V = {H, P, 0, 7) be a PAP and let h G H he a hypothesis. First 
we compute the maximum value max that the function sum^ may return over all sets 
H' C H. Note that max is polynomial-time computable from V, because 7 is a polynomial- 
time computable function. It follows that its size \max\ = log max is OdT'j'^), for some 
constant fc > 0, because the output size of a polynomial-time computable function is 
polynomiaUy-bounded, as well. 

Then, by a binary search on [0, max], we compute the cost c of the optimal solutions 
for V: at each step of this search, we are given a threshold s and we call an NP oracle to 
know whether there exists an admissible solution below s. After, log max steps at most, 
this procedure ends, and we get the value c. Finally, we ask another NP oracle whether 
there exists an admissible solution containing h and whose cost is c. Note that the number 
of steps and hence the number of oracle calls is polynomial in the input size, and thus 
deciding whether h is relevant is in A2 ■ 

(Hardness). We reduce the Al'-complete problem of deciding whether a TSP instance 
/ has a unique optimal tour jPapadimitriou 1984) to the relevance problem for the PAP 
V = {H, ^,P,0), defined below, whose optimal solutions encode, intuitively, pairs of 
optimal tours. The set of hypotheses is — {c{i,j),c'{i,j) \ 1 < i,j < n}\j{heq,hdiff}, 
where c{i, j) (resp., c'{i,j)) says that the salesman visits city j immediately after city i, 
according to the tour encoded by the atoms with predicate c (resp., c'). Moreover, the 
special atoms h^q and hdiff encode the hypotheses that such a pair of tours represents in 
fact a unique optimal tour, or two distinct tours. 

For each pair of cities Ci,Cj, the penalty function 7 encodes the cost function w of 
travelling from q to Cj, that is, '^{c{i,j)) = w{i,j) and 7(c'(i, j)) — w{i,i). Moreover, 
for the special atoms, define ^{h^q) = 1 and "fihdiff) = 0.5. 

The program P, shown below, is similar to the TSP encoding described in Section lTjl 



(1) 


visited(I) 


— visited{J), c{J, I). 


(2) 


visited{l) 


-c(J,l). 


(3) 


badTour 


-c{I,J),c{I,K),J^K. 


(4) 


badTour 


-c{J,I),c{K,I),Jj^K. 


(1') 


visited' {I) 


- visited' { J), c' {J, I). 


(2') 


visited' {1) 


-c'(J,l). 


(3') 


badTour 


-c'{I,J),c'{I,K),J^K. 


(4') 


badTour 


~c'{J,I),c'iK,I),J^K. 


(5) 


diff 


~c{I,J),c'{I,K),Jy^K. 


(6) 


ok 




C7) 


ok 


~ hdtff, diff. 



The observations are O = {ok, not badTour^ U {visited{i), visited' [i) |< i < n). 

Note that every admissible solution S for V encodes two legal tours for /, through 
atoms with predicates c and c'. Moreover, S contains either h^q or hdijj, in order to derive 
the observation ok. Furthermore, if S is optimal, then at most one of these special atoms 
belongs to S, because one is sufficient to get ok. However, if the chosen atom is hdiff, ok 
is derivable only if diff is true, i.e., the two encoded tours are different, by rule (5). 
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Let tmin be the cost of an optimal tour of /. Then, the best admissible solution 5* such 
that heq £ S has cost 2t,„i„ + 1, because it should contain the hypotheses encoding two 
(possibly identical) optimal tours of /, and the atom hf.q. 

We show that there is a unique optimal tour for / if and only if h^q is a relevant hypoth- 
esis for V. 

{Only if). Let T be the unique optimal tour T for /, and S the admissible solution for V 
such that heq G S and both the atoms with predicate c and those with predicate c' encode 
the tour T. Then, S is an optimal solution, because any admissible solution S' that does 
not contain h^q should contain both hdig and diff . Since T is the unique optimal tour, any 
otherlegal tour T' has cost i„jj,j + l, at least. Hence, sum^{S') > imMi + (^?mn + l)+0.5 > 
sum^{S). Thus, h^q is relevant for V, because belongs to the optimal solution S. 

(If). If heq is relevant for V, there is an optimal solution S such that h^q G S. Recall that 
sum^{S) — 2t„iin + 1- Assume by contradiction that there are two distinct optimal tours 
T and T' for /, and let S' be an admissible solution such that: its atoms with predicates 
c and c' encode the distinct tours T and T', and both diff and hdijj belong to S'. Then, 
sumj{S') ~ 2t,nin + 0.5 < sumj{S), a contradiction. 

Finally, note that P is constraint-free positive program, and both P and its ground in- 
stantiation can be computed in polynomial time from the instance /. □ 

Not surprisingly, the necessity problem has the same complexity as the relevance prob- 
lem. 

Theorem 5.6 

Deciding whether an hypothesis is necessary for a PAP V — {H, P, 0, 7) is -complete. 
Hardness holds even if P is a constraint-free positive program. 

Proof 

(Membership). Let V = {H, P, 0, 7) be a PAP and let /i G i? be a hypothesis. We compute 
the cost c of the optimal solutions for P, as shown in the proof of Theorem l5.5l Finally, we 
ask an NP oracle whether there exists an admissible solution whose cost is c, and does not 
contain h. If the answer is no, then his a necessary hypothesis. Clearly, even in this case, 
a polynomial number of calls to NP oracle suffices, and thus the problem is in A2 . 

(Hardness). Let / be a TSP instance and P the PAP defined in the proof of Theorem l5.5l 
Note that the same reasoning as in the above proof shows that / has a unique optimal tour 
if and only if heq is a necessary hypothesis for P. □ 

Theorem 5. 7 

Computing an optimal solution for a PAP P — {H, P, 0, 7) is FP'^^-complete. Hardness 
holds even if P is a constraint-free positive program. 

Proof 

(Membership). Let M be a deterministic Turing transducer AI with oracles in NP that 
act as follows. First, M checks in NP whether P is consistent, as shown in the proof 
of Theorem 15.11 If this not the case, then M halts and writes on its output tape some 
special symbol encoding the fact that P is inconsistent. Otherwise, M computes with a 
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polynomial number of steps the value c of the optimal solutions for V, as shown in the 
proof of Theorem l5.5l Now, consider the following oracle O: given a set of hypotheses S, 
decide whether there is an admissible solution for V whose cost is c. It is easy to see that O 
is in NP (we describe a very similar proof in the membership part of the proof of Theorem 

The transducer AI maintains in its worktape (the encoding of) a set of hypotheses S, 
which is initialized with 0. Then, for each hypotheses h G H, M calls the oracle O with 
input 5* U {h}. If the answer is yes, then M writes h on the output tape and adds h to the 
set S. Otherwise, S is not changed, and M proceeds with the next candidate hypothesis. It 
follows that, after \H\ of these steps, the output tape encodes an optimal solution of V. 

(Hardness). Immediately follows from our encoding of the TSP problem shown in Section 
14.11 and the fact that this problem is FP'^^ -complete ( |Papadimitriou 1994t . □ 



6 Implementation Issues 

In this section, we describe the implementation of a system supporting our formal model 
of abduction with penalties over logic programs. The system has been implemented as a 
front-end for the DLV system. Our implementation is based on a translation from such 
abduction problems to logic programs with weak constraints (.Buccafurri et al. 2000) . that 
we show to be both sound and complete. We next describe the architecture of the prototype. 
We then briefly recall Logic Programming with Weak Constraints (the target language of 
our translation), define precisely our translation algorithm and prove its correctness. 



6.1 Architecture 

Figure|5lshows the architecture of the new abduction front-end for the DLV system, which 
implements the framework of abduction with penalization from logic programs, and is al- 
ready incorporated in the current release of DLV (available at the DLV homepage |www . dlvsystem . com| l. 




Abduction 
Rewriting 



DLV System 



Stable IVIodels 
To 

Abductive Solutions 




Fig. 5. System Architecture 



A problem of abduction in DLV consists of three separate files encoding the hypotheses, 
the observations, and the logic program. The first two files have extensions . hyp and 
. obs, respectively, while no special extension is required for the logic-program file. The 
abduction with penalization front-end is enabled through the option -FDmincost. In this 
case, from the three files above, the Abduction-Rewriting module builds a logic program 
with weak constraints, and run DLV for computing a best model M of this logic program. 
Then, the Stable-Models-to-Abductive-Solutions module extracts an optimal solution from 
the model AI. 
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For instance, consider the network problem in Example 13.21 and assume that the facts 
encoding the hypotheses are stored in the file network . hyp , the facts encoding the ob- 
servations are stored in the file network . obs, and the logic program is stored in the file 
netwok . dl. Then, the user may obtain an optimal solution for this problem by running: 

dlv -FDmincost network. dl network. hyp network. obs 

By adding option -wet race the system prints also the (possibly not optimal) solutions 
that are found during the computation. This option is useful to provide some solution to the 
user as soon as possible. Note that the "quality" of the solutions increases monotonically 
(i.e., the cost decreases), and the system gradually converges to optimal solutions. 

Note that the current release deals with integer penalties only; however, it can be ex- 
tended easily to real penalties. 

6.2 Logic Programming with Weak Constraints 

We first provide an informal description of the LP™ language by examples, and we then 
supply a formal definition of the syntax and semantics of LP"". 

6.2.1 LP"' by Examples 

Consider the problem SCHEDULING, consisting in the scheduling of course examina- 
tions. We want to assign course exams to time slots in such a way that no couple of ex- 
ams are assigned to the same time slot if the corresponding courses have some student in 
common - we call such courses "incompatible". Supposing that there are three time slots 
available, tsi, tS2 and tsa, we express the problem in LP™ by the following program 

Psch' 

ri : assign(X, tsi) ;— course(X), not assign(X, tS2), not assign(X, tsa). 

r2 : assign(X, tS2) :— course(X), not assign(X, tsi), not assign(X, tss). 

r3 : assign(X, tsa) :— course(X), not assign(X, tsi), not assign(X, tS2). 

Si : ;— assign(X, S), assign(Y, S), cominonStudents(X, Y, N). 

Here we assumed that the courses and the pair of courses with common students are 
specified by input facts with predicate course and commonStudents, respectively. In par- 
ticular, commonSudents(a, b, k) means that there are k > students who should attend 
both course a and course b. Rules ri, r2 and say that each course is assigned to one of the 
three time slots tsi, tS2 or tsa; the strong constraint si expresses that no two courses with 
some student in common can be assigned to the same time slot. In general, the presence of 
strong constraints modifies the semantics of a program by discarding all models which do 
not satisfy some of them. Clearly, it may happen that no model satisfies all constraints. For 
instance, in a specific instance of above problem, there could be no way to assign courses 
to time slots without having some overlapping between incompatible courses. In this case, 
the problem does not admit any solution. However, in real life, one is often satisfied with 
an approximate solution, in which constraints are satisfied as much as possible. In this 
light, the problem at hand can be restated as follows (APPROX SCHEDULING): "assign 
courses to time slots trying to avoid overlapping courses having students in common." In 
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order to express this problem we introduce the notion of weak constraint, as shown by the 
following program Pa_sch'- 

ri : assign(X, tsi) ;— course(X), not assign(X, tS2)not assign(X, tss). 

r2 : assigii(X, tS2) :— course(X), not assign(X, tSi)not assign(X, tss). 

ra : assign(X, tSs) :— course(X). not assign(X, tSi)not assign(X, tS2). 

wi : :^ assign(X, S), assign(Y, S), commonStud.ents(X, Y, N). 

From a syntactical point of view, a weak constraint is like a strong one where the impli- 
cation symbol : — is replaced by :~. The semantics of weak constraints minimizes the 
number of violated instances of constraints. An informal reading of the above weak con- 
straint wi is: "preferably, do not assign the courses X and Y to the same time slot if they are 
incompatible". Note that the above two programs Pgch and Pa.sch have exactly the same 
preferred models if all incompatible courses can be assigned to different time slots (i.e., if 
the problem admits an "exact" solution). 

In general, the informal meaning of a weak constraint, say, :~ B., is "try to falsify B" or 
"B is preferably false", etc. Weak constraints are very powerful for capturing the concept 
of "preference" in commonsense reasoning. 

Since preferences may have, in real life, different "importance", weak constraints in 
LP™ can be supplied with different weights, as well.'* For instance, consider the course 
scheduling problem: if overlapping is unavoidable, it would be useful to schedule courses 
by trying to reduce the overlapping "as much as possible", i.e. the number of students 
having some courses in common should be minimized. We can formally represent this 
problem (SCHEDULING WITH WEIGHTS) by the following program P^.sc/i: 

ri : assign(X, tsi) ;— course(X), not assign(X, tS2)not assign(X, tss). 

r2 : assign(X, tS2) :— course(X), not assign(X, tSi)not assign(X, tSs). 

rs : assign(X, tSs) :— course(X), not assign(X, tsi)not assign(X, tS2). 

Wi : :~ assign(X, S), assign(Y, S), commonStudents(X, Y,N). [N :] 

The preferred models (called best models) of the above program are the assignments of 
courses to time slots that minimize the total number of "lost" lectures. 

6.2.2 Syntax and Semantics 
A weak constraint has the form 

:~Li,--- ,Lm. [w :] 

where each Li, 1 < i < m, is a literal and w is a term that represents the weight.^ In 
a ground (or instantiated) weak constraint, w is a normegative integer. If the weight w is 
omitted, then its value is 1, by default. 

* Note that weights are meaningless for strong constraints, since all of them must be satisfied. 

^ In their general form, weak constraints are labelled by pairs [w : ^] , where w is a weight and ^ is a priority level. 

However, in this paper we are not interested in priorities and we thus describe a simplified setting, where we 

only deal with weights. 
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An LP '" program P is a finite set of rules and constraints (strong and weak). If P does 
not contain weak constraints, it is called a normal logic program. 

Informally, the semantics of an LP*" program P is given by the stable models of the set 
of the rules of P satisfying all strong constraints and minimizing the sum of weights of 
violated weak constraints. 

Let R, S, and W be the set of ground instances of rules, strong constraints, and weak 
constraints of an LP"" program P, respectively. A candidate model of P is a stable model 
of R which satisfies all strong constraints in S. A weak constraint c is satisfied in / if some 
literal of c is false w.rt. /. 

We are interested in those candidate models that minimize the sum of weights of vio- 
lated weak constraints. More precisely, given a candidate model AI and a program P, we 
introduce an objective function H-piM), defined as: 

Hv{M) = ^ weight (c) 

cG Violated^j 

where Violated^ j = { c G W \ cisa weak constraint violated by M } and weight (c) 
denotes the weight of the weak constraint c. A candidate model M of P is a best model of 
P if H-p{AI) is the minimum over all candidate models of P. 
As an example, consider the following program Pg'. 

a:— c,notb. :^ a, c. [1:] 

- b. [2 :] 
b ;— c,not a. :^ a. [1 :] 

- b,c. [1:] 

The stable models for the set { c. a :— c, not b. b :— c, not a.} of ground rules of this 
example are 7i-p^(Mi) = {a, c} and TLvA^h) = {^j c}, they are also the candidate 
models, since there is no strong constraint. In this case, Ti-p^ (^^i) ~ 2, and Ti-p^ {M2) — 3. 
So Ml is preferred over M2 {Mi is a best model of Pg)- 

6.3 From Abduction with Penalization to Logic Programming with Weak Constraints 

Our implementation of abduction from logic programs with penalization is based on the 
algorithm shown in Figure |6l which transforms a PAP V into a logic program LP*" (P) 
whose stable models correspond one-to-one to abductive solutions of V . 
We illustrate this algorithm by an example. 

Example 6.1 

Consider again the Network Diagnosis problem described in Example 13. 21 The translation 
algorithm constructs an LP"" program Q. First, Q is initialized with the logic program 
P. Therefore, after Step 1, Q consists of the set of facts encoding the network and of the 
following rules: 

reaches(X, X) :— node(X), not offline(X). 

reaches(X, Z) :— reaches(X, Y), connected(Y, Z), not offline(Z). 
Then, in the loop 3-5, the following groups of rules and weak constraints are added to Q. 
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Input: A PAP V={H, P, 0, 7) . 

Output: A logic program with weak constraints LP"" {V). 

Function AbductionToLP'^CP : PAP) : LP"" 
var i, j: Integer; 

Q:LP"; 
begin 

(1) Q:=P; 

(2) LetH = {hi,...,h„); 

(3) for i := 1 to n do 

(4) add to Q the following three clauses 
(4.a) hi :-_sol(i) . 

(4.b) _sol(i) :-not _iisol(i) . 

_nsol(i) :— not _sol(i) . 
(4.C) :~hi [7(hi):]. 

(5) endJbr 

(6) L&tO ^ {0i,...,0m}; 

(7) for j := 1 torn do 

(8) if Oj is a positive literal "a" 

(9) then add to Q the constraint : — ^ a . 

(10) else (* Oj is a negative literal "-1 a" *) 

(11) add to Q the constraint :— a . 

(12) endJor 

(13) return Q; 
end 

Fig. 6. Translating a PAP V into a logic program LP™ {V) 

Input: A stable model M of LP™ {V), where V is {H, P, 0, 7). 
Output: A solution of V. 

Function ModelToAbductiveSolution(M : AtomsSet): AtomsSet 

var S : AtomsSet; 

begin 

return H nM; 

end 



Fig. 7. Extracting a solution of V from a stable model of LP'^ (V) 



At Step 4.a: 

offline(a) :-_sol(l). offline(b) :-.sol(2). ••• offline(f) :-.sol(6). 



At Step 4.b: 



_sol(l) :-not _nsol(l). _sol(2) :-not _nsol(2). ••• _sol(6) :-not _nsol(6). 
_nsol(l) :— not _sol(l). _nsol(2) :— not _sol(2). ••• _nsol(6) :— not _sol(6). 



At Step 4.c: 



:~ offline(a). [7(0! f line(a)) :] 
:~ off line(b). [7(of f line(b)) :] 



:~ of f line(f ). [7(of f line(f )) :] 
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The above rules select a set of hypotheses as a candidate solution, and the weak con- 
straints are weighted according to the hypotheses penalties. Thus, weak constraints allow 
us to compute the abductive solutions minimizing the sum of the hypotheses penalties, that 
is, the optimal solutions. 

Finally, to take into account the observations, the following constraints are added to Q 
in the loop 7-12: 

:— not offline(a). 
:— not offline(e). 
not reaches(a, e). 

This group of (strong) constraints is added to Q in order to discard stable models that do 
not entail the observations. 

Note that, since in this example all observations are positive literals. Step 11 is never 
executed. 

The logic program LP"" (7-*) computed by this algorithm is then evaluated by the DLV 
kernel, which computes its stable models. For each model M found by the kernel, the 
ModelToAbductiveSolution function (shown in Figure is called in order to extract the 
abductive solution corresponding to M. 

The next theorem states that our strategy is sound and complete. For the sake of presen- 
tation, its proof is reported in Appendix |Appendix A| 

Theorem 6.2 

{Soundness) For each best model M of LP"" {V), there exists an optimal solution A for V 
such that Mr\H = A. 

{Completeness) For each optimal solution A of V, there exists a best model M of LP"" [V) 
such that Mr\H = A. 

7 Related Work 

Our work is evidently related to previous studies on semantic and knowledge representation 
aspects of abduction over logic programs ( Kakas and Mancarella 1990bllLifschitz and Turner 19941 
IKakas et al. 2000l [penecker and De Schre ye 1998|ILin and You 2002I I. that faced the main 
issues concerning this form of non-monotonic reasoning, including detailed discussions on 
how such a formalism may be used effectively for knowledge representation - for a nice 
survey, see ( Deneck er and Kakas 2002J . 

However, all these works concerning abduction from logic programs do not deal with 
penalties. The present paper focuses on this kind of abductive reasoning from logic pro- 
grams, and our computational complexity analysis extends and complements the previous 
studies on the complexity of abductive reasoning tasks ( Eiter and Gottlob 1995l lEiter et al. 1997> . 

The optimaUty criterion we use in this paper for identifying the best solutions (or expla- 
nations) is the minimization of the sum of the penalties associated to the chosen hypothe- 
ses. Note that this is not the only way of preferring some abductive solutions over others. In 
fact, the traditional approach, also considered in the above mentioned papers, is to look for 
minimal solutions (according to standard set-containment). From our complexity results 
and from the results presented in dEiter et al. 199 7 1, it follows that the (set) minimal expla- 
nation criterion is more expensive than the one based on penalties, from the computational 
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point of view. Moreover, this kind of weighted preferences has been recognized as a very 
important feature in real applications. Indeed, in many cases where quantitative informa- 
tion plays an important role, using penalties can be more natural than using plain atoms and 
then studying some clever program such that minimal solutions correspond to the intended 
best solutions. As a counterpart, if necessary, in the minimal-explanations framework we 
can represent some problems belonging to high complexity classes that cannot be repre- 
sented in the penalties framework. It follows that the two approaches are not comparable, 
and the choice should depend on the kind of problem we have to solve. 

Another possible variation concerns the semantics for logic programs, which should 
not be necessarily the stable model semantics. For instance, in JPereira et al. 199 ft . a 
form of hypothetical reasoning is based on the well-founded semantics. In some pro- 
posals, the semantics is naturally associated to a particular optimality criterion, as for 
dlnoue a nd Sakama 1999 ), where the authors consider prioritized programs under the pre- 
ferred answer set semantics. 

A similar optimization criterion is proposed for the logic programs with consistency- 
restoring rules (cr-rules) described in (Balduccini and Gelfond 200 3 1. Such rules may con- 
tain preferences and are used for making a give program consistent, if no answer set can 
be found. Firing some of these rules and hence deriving some atoms from their heads cor- 
responds in some way to the hypotheses selection in abductive frameworks. Indeed, the 
semantics of this language is based on a transformation of the given program with cr-rules 
into abductive programs. 

Such optimization criteria induce partial orders among solutions, while we have a total 
order, determined by the sum of penalties. We always have the minimum cost and the 
solutions with this cost constitute the equivalence class of optimal solutions. Note that 
even these frameworks are incomparable with our approach based on penalties, and which 
approach is better just depends on the application one is interested in. 

Since we provide also an implementation of the proposed framework, our paper is also 
related to previous work on abductive logic programming systems J Van Nuffelen and Kakas 20011 
IKakas et al. 200 H . More links to systems and to some interesting applications of abduction- 
based frameworks to real-world problems can be found at the web page (.Toni 200 3 ). 

We remark that we are not proposing an algorithm for solving optimizations problems. 
Rather, our approach is very general and aims at the representation of problems, even of 
optimization problems, in an easy and natural way through the combination of abduction, 
logic programming, and penalties. It is worthwhile noting that our rewriting procedure into 
logic programs with weak constraints (or similar kind of logic programs) is just a way 
for having a ready-to-use implementation of our language, by exploiting existing systems, 
such as DLV JEiter et al. 1998llLeone et al. 2002t or smodels JNiemela and Simons 19971 
ISimons et al. 2002l l. Differently, operations research is completely focused on finding so- 
lutions to optimization problems, regardless of representational issue. In this respect, it is 
worthwhile noting that, in principle, one can also use techniques borrowed from operations 
research for computing our abductive solutions (e.g., by using integer programming). 

A second point is that in the operations research field one can find algorithms specifically 
designed for solving, e.g., only TSP instances, or even only some particular TSP instances 
(IGutin and Punnen 2002t . It follows that our general approach is not in competition with 
operations research algorithms. Rather, such techniques can be exploited profitably for 
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computing abductive solutions, if we know that the programs under consideration are used 
for representing some restiicted class of problems. 

8 Conclusion 

We have defined a formal model for abduction with penalization from logic programs. 
We have shown that the proposed formalism is highly expressive and it allows to encode 
relevant problems in an elegant and natural way. We have carefully analyzed the compu- 
tational complexity of the main problems arising in this abductive framework. The com- 
plexity analysis shows an interesting property of the formaUsm: "negation comes for free" 
in most cases, that is, the addition of negation does not cause any further increase to the 
complexity of the abductive reasoning tasks (which is the same as for positive programs). 
Consequently, the user can enjoy the knowledge representation power of nonmonotonic 
negation without paying high costs in terms of computational overhead. 

We have also implemented the proposed language on top of the DLV system. The im- 
plemented system is already included in the current DLV distribution, and can be freely 
retrieved from DLV homepage |www . dlvs ystem . corn| for experiments. 

It is worthwhile noting that our system is not intended to be a specialized tool for solving 
optimization problems. Rather, it is to be seen as a general system for solving knowledge- 
based problems in a fully declarative way. The main strength of the system is its high-level 
language, which, by combining logic programming with the power of cost-based abduc- 
tion, allows us to encode many knowledge-based problems in a simple and natural way. Ev- 
idently, our system cannot compete with special purpose algorithms for, e.g., the Travelling 
Salesman Problem; but it could be used for experimenting with nonmonotonic declarative 
languages. Preliminary results of experiments on the Travelling Salesman Problem and on 
the Strategic Companies Problem (see Section 13 show that the system can solve also in- 
stances of a practical interest (with more than 100 companies for Strategic Companies and 
30 cities for Travelling Salesman). 
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Appendix A Proof of Theorem l6.2l 

In this appendix, we prove that the rewriting approach described in Section|6|is sound and 
complete. 

First, we recall an important result on the modularity property of logic programs under 
the stable model semantics, proved in (.Eiter et al. 1997> . 

Let Pi and P2 be two logic programs. We say that P2 potentially uses Pi (P2 > Pi) iff 
each predicate that occurs in some rule head of P2 does not occur in Pi . 

Moreover, given a set of atoms M and a program P, we denote by M \ p the set of all 
atoms from M that occur in P, i.e., M\p = M D Bp. 

Proposition Appendix A. 1 

(Eiter etal. 1997> Let P = Pi U P2 be a lo gic program such that P2 potentially uses Pi. 
Then, 

(i) for every M e SM{P), M\p, e SM{Pi); 

(ii) SM{P) = UMe5M(Pi) SM{P2Ufacts{M)). 

Lemma I 

Let P = Pi U P2 be a logic program such that P = Pi U P2 and P2 [> Pi . Then, for every 
M e SM{P),M\p^ is astablemodelforP2 U/acte(Af|pi n PpJ. 

Proof 

From Proposition [Appendix A.l| (ii), it follows that there exists Mi G SM{Pi) such that 
M e SM{P2 U facts (Ml)). We claim that Mi = M\p,. 

(Ml CM\pJ. Immediately follows from the fact that Mi C M, because M e SM{P2 U 
facts{Mi)). 

(Af|pi ^ Ml). Suppose by contradiction that there exists an atom a e Af such that 
a e M\p-^ but a ^ Mi. It follows that a is not defined in Pi and thus there exists some 
rule r of P2 having a in its head. However, this is impossible, as we assumed that P2 [> Pi. 
Contradiction. 
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Thus, M is a stable model of P2 U facts{M\p^). Let A = M\p^ n Bp^ and X = 
M|pj - A; whence, facts{M\p^) = facts{A) U facts{X). Note that X contains all and 
only the atoms of M not occurring in P2- Therefore, it is easy to see that M — X is a stable 
model for P2 U {facts{M\p-^) ~ facts{X)), which is equal to P2 U facts{A). Moreover, 
observe that A/ - X = Mjp^, and thus we get Mjp^ £ SM{P2 U facts{A)). □ 

For the sake of presentation, we assume hereafter a given PAP problem V = {H, P,0^^) 
is fixed, and let LP"'(7') — P iJ Phyp U Pobs be the program computed by the function 
AbductionToLP"" {V), where Phyp is the set of rules and weak constraints obtained by ap- 
plying steps (3)-(5), and Pobs is the set of strong constraints obtained by applying steps 
(7)-(12). 

Lemma 2 

For each stable model M of LP'" {V), 

(a) there exists an admissible solution A for V such that M H = A, and 

(b) sum^[A) = Hv{M). 

Proof 

(Part a). To show that Af n H is an admissible solution for V we have to prove that there 
exists a stable model A/' of P U facts {M n H) such that, Vo G O, o is true w.rt Af '. 

Let M' = M\p. Note that Af ' is the set of literals obtained from A/ by eliminating all 
the Uterals with predicate symbol sol and jnsol, i.e. M' is the set of literals without all 
atoms which were introduced by the translation algorithm. 

Note that P potentially uses Phyp- Thus, from Lemma Q M\p is a stable model for 
PUfacts{C), where C = M\p^^^ D Bp and hence C — AI D H, because only hypothesis 
atoms from Phyp occur in Pp. 

Finally, observe that each observation in O is true w.rt Af. Indeed, since M is a stable 
model for LP^{V), all the constraints contained in P\obs must be satisfied by A/. More- 
over, M and M' coincide on all atoms occurring in these constraints. Thus, all constraints 
contained in Pobs are satisfied by A/', too. 

(Part b). By construction of LP"'(7'), all weak constraints occurring in this program in- 
volve hypotheses of P. In particular, observe that any weak constraint :~ h [y{h) :] is 
violated by Af iff h belongs to Af. Since A — M n H, h belongs to A, as well, and its 
penalty is equal to the weight of the weak constraint. It follows that sum^{A) = TC-p{M). 
□ 

Lemma 3 

For each admissible solution A of V, 

(a) there exists a stable model Af of LP'" (V) such that M n H = A, and 

(b) nv{M) = sum^iA). 
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Proof 

(Part a). By Definition EH there exists a stable model M' = M" U A of P U facts {A), 
where M" n A = 0, such that, Vo e O, o is true w.r.t. M. 

Let Mh = {-Sol{i) I /i^ G A} U {jnsol{j) \ hj ^ A}. 

Moreover, let P' — P U facts {Mh U A). Note that P' can also be written as the union 
of the programs P U facts (A) and facts (Mff). Since these two programs are completely 
disjoint, i.e., the intersection of their Herbrand bases is the empty set, then the union of 
their stable models M' and Mjf, say M, is a stable model of P' . 

Now, consider the program P U Phijp, and observe that P [> Phyp, and that Mh U A 
is a stable model for Phyp- Then, by Proposition [Appendix A.l| any stable model of the 
program P' is a stable model of P U Phyp- Thus, in particular, M = M' U Mh is a stable 
model of P U Phyp- 

Moreover, it is easy to see that all constraints in Pobs are satisfied by M, and thus M G 
5'M(LP"'(P)), too. Finally, M can be written as M" U AU Mh, and hence MnH = A 
holds, by definitions of M" and Mh- 

(Part b). Let h be any hypothesis belonging to A and hence contributing to the cost of this 
solution. Note that LP'"(7-') contains the weak constraint :^ h [7(/i) :], weighted by 7(/i) 
and violated by M, as ft, G M. It follows that HviM) = sufn^f{A)- □ 

Theorem ]6.2\ 

{Soundness) For each best model M of LP™(P), there exists an optimal solution A for V 
such that MfMI^A- 

(Completeness) For each optimal solution A of V, there exists a best model M of LP"' ("P) 
such that M n H ^ A. 

Proof 

(Soundness). Let M be a best model of LP"'(P). From Lemma|2l A = M Ci H is an 
admissible solution for V, and sMm-y(yl) — Ti-p{M). It remains to show that A is optimal. 

By contradiction, assume that A is not optimal. Then, there exists an admissible solution 
A' for V such that sum^{A') < sum^{A)- By virtue of Lemma |3] we have that there 
exists a stable model M' for LP'"(P) such that M' nH = A' and H-piM') = sum-y{A')- 
However, this contradicts the hypothesis that M is a best model for LP"" (P). 

( Completeness). Let A be an optimal solution for P. By virtue of Lemma|3] there exists 
a stable model M for LP"' (P) such that MnH = Aand HviM) = sumy(^A). We have 
to show that M is a best model. 

Assume that M is not a best model. Then, there exists a stable model M' for LP"'(P) 
such that T-L-p{M') < H-piM). By Lemma|2l there exists an admissible solution A' for 
P such that M' n H = A' and sum^{A') = Uv{M'). However, this contradicts the 
hypothesis that A is an optimal solution for P. □ 

Appendix B A Logic Program for the Travelling Salesman Problem 

In this section we describe how to represent the Travelling Salesman Problem in logic 
programming. 

Suppose that the cities are encoded by a set of atoms {city(i) | 1 < i < n} and that 
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the intercity traveling costs are stored in a relation C(i, j , v) where v = 'w{i,j).ln abuse of 
notation, we simply refer to the number n of cities (which is provided by an input relation) 
by itself. 

The following program tti computes legal tours and their costs in its stable models: 



(1) 


T(I,J)Vf(I,J) 


-c(I,J,_). 


(2) 




-T(I,J),T(I,K),J#K. 


(3) 




-T(I,K),T(J,K),I^ J. 


(4) 


visited(l) 


-T(J,1). 


(5) 


visited(I) 


- T(J, I),visited(J). 


(6) 




— not visited(I), city(I). 


(7) 


P_Value(l,X) 


-T(1,J),C(1,J,X). 


(8) 


P_Value(K,X) 


- P_Value(K-l, Y), T(K, I), C(K, I, Z), X = Y + Z. 


(9) 


Cost(x) 


- P_Value(n, x). 



The first claus guesses a tour, where T(I, J) intuitively means that the 7-th stop of the 
tour is city J and f (I, J) that it's not. By the minimality of a stable model, exactly one of 
T(I, J) and f (I, J) is true in it, for each / and J such that 1 < /, J < n; in all other cases, 
both are false. 

The subsequent clauses (2)-(6) check that the guess is proper: each stop has attached 
at most one city, each city can be attached to at most one stop, and every stop must have 
attached some city. The rules (7)-(9) compute the cost of the chosen tour, which is given 
by the (unique) atom Cost(X) contained in the model. 

It holds that the stable models of tti correspond one-to-one to the legal tours. 

To reach our goal, we have to eliminate from them those which do not correspond to 
optimal tours. That is, we have to ehminate all tours T such that there exists a tour T' which 
has lower cost. This is performed by a logic program, which basically tests all choices for 
a tour T' and rules out each choice that is not a cheaper tour, which is indicated by a 
propositional atom NotCheaper. The following program, which is similar to tti, generates 
all possible choices for T': 



(1') 


T'(I, J) Vf'(I, J) 


-c(I,J,_). 


(2') 


NotCheaper 


-T'(I,J),T'(I,K),J^K. 


(3') 


NotCheaper 


-T'(l,K),T'(J,K),l7^ J. 


(4') 


NotChosen^top(l, 1) 


-T'(I,1). 


(5') 


NotChosen^top(I, J) 


-f '(I, J), NotChosen^top(I, J - 1). 


(6') 


NotCheaper 


— NotChosen_Stop(I, n). 


(7') 


cnt(l, 1) 




(8') 


cnt(K-h 1, J) 


-cnt(K, I),T'(I, J), J 7^ 1. 


(9') 


NotCheaper 


-cnt(K, l),T'(l,l),K7^n. 


(100 


P_Value'(l,X) 


-T'(1,J),C(1,J,X). 


(11') 


P_Value'(K,X) 


-P_Value'(K - 1, Y), T'(K, I), c(K, I, Z), X = Y + Z. 


(12') 


Cost'(X) 


-P_Value'(n,X). 



The predicates T', f ', P_Value' and Cost' have the role of the predicates T, f , P_Value 
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and Cost in tti. Since we do not allow negation, the test that for each stop a city has 
been chosen (rules (4)-(6) in tti) has to be implemented differently (rules (4')-(9') ). 
MotChosen_Stop(l, j) tells whether no city < J has been chosen for stop /. Thus, if 
NotChosen_Stop(l, n) is true, then no city has been chosen for stop I, and the choice for 
T' does not correspond to a legal tour. 

The minimal models of (l')-(12') which do not contain NotCheaper correspond one- 
to-one to all legal tours. By adding the following rule, each of them is eliminated which 
does not have smaller cost than the tour given by T: 

(13') NotCheaper :- Cost(X), Cost'(Y), X < Y. 

Thus, if for a legal tour T, each choice for T' leads to the derivation of NotCheaper, then 
T is an optimal tour. 
For the desired program, we add the following rules: 

(14') :- not NotCheaper. 

(15') P(Xi, . . . , Xn) :- NotCheaper. , 

for any predicate P that occurs in a rule head of (l')-(12') except NotCheaper. The first 
rule enforces that NotCheaper must be contained in the stable model; consequently, it 
must be derivable. The other rules derive the maximal extension for each predicate P if 
NotCheaper is true, which is a trivial model for (l')-(12'). In fact, it is for some given 
tour T the only model if no choice for T' leads to a tour with cost smaller than the cost of 
T; otherwise, there exists another model, which does not contain NotCheaper. 

Let 772 be the program consisting of the rules (l')-(15'). Then, it holds that the stable 
models of tt = tti U 772 on any instance of TSP correspond to the optimal tours.^ In 
particular, the optimal cost value, described by Cost(X), is contained in each stable model. 
Thus, the program tt computes on any instance of TSP under the possibility (as well as 
certainty) stable model semantics in Cost the cost of an optimal tour. 



Here, we suppose that the provided universe U of the database storing the instance is sufficiently large for 
computing the tour values. 



